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Abstract — In this paper, we presented three approaches to 
improve the design of Kite codes (newly proposed rateless codes), 
resulting in an ensemble of rate-compatible LDPC codes with 
code rates varying "continuously" from 0.1 to 0.9 for additive 
white Gaussian noise (AWGN) channels. The new ensemble 
rate-compatible LDPC codes can be constructed conveniently 
with an empirical formula. Simulation results show that, when 
applied to incremental redundancy hybrid automatic repeat 
request (IR-HARQ) system, the constructed codes (with higher 
order modulation) perform well in a wide range of signal-to- 
noise-ratios (SNRs). 

I. Introduction 

Rate-compatible low-density parity-check (RC-LDPC) 
codes, which may find applications in hybrid automatic 
repeat request (HARQ) systems, can be constructed in at 
least two ways. One way is to puncture (or extend) properly 
a well-designed mother code |l|-|6J]. Another way is to turn 
to rateless coding l7Ml0ll. resulting in rate-compatible codes 
with incremental redundancies yj. | . 

Recently, the authors proposed a new class of rateless 
codes, called Kite codes 11211 . which are a special class of 
prefix rateless low-density parity-check (LDPC) codes. Kite 
codes have the following nice properties. First, the design 
of Kite codes can be conducted progressively using a one- 
dimensional optimization algorithm. Second, the maximum- 
likelihood decoding performance of Kite codes with binary- 
phase-shift-keying (BPSK) modulation over AWGN channels 
can be analyzed. Third, the definition of Kite codes can be 
easily generalized to groups |fl3ll . In this paper, we attempt 
to improve the design of Kite codes and investigate their 
performance when combined with high-order modulations. 

The rest of this paper is organized as follows. In Section HI] 
we review the construction of the original Kite codes. The 
design of Kite codes and existing issues are discussed. In 
Section |nil we present three approaches to improve the design 
of Kite codes, resulting in good rate-compatible codes with 
rates varying continuously from 0.1 to 0.9 for additive white 
Gaussian noise (AWGN) channels. In section [IV] we present 
simulation results for the application of Kite codes to the 
HARQ system. Section [Vl concludes this paper. 

II. Review of Kite Codes 

A. Definition of Kite Codes 

An ensemble of Kite codes, denoted by /C[oo, k;p), is 
specified by its dimension k and a so-called p-sequence 



V = (Po,Pi,--- ,Ptr--) with < p t < 1 for t > 0. A 
codeword c = (v, w) G K,[oo,k;p] consists of an informa- 
tion vector v = («o,i>i,-- - , v/c-i) and a parity sequence 
w = (wq,wi, ■ ■ ■ ,Wt,---)- The parity bit at time t > 
can be computed recursively by wt — u>t-i + 2~^o<i<fc ht,i v i 
with W-i = 0, where ht,% (0 < i < k) are k independent 
realizations of a Bernoulli distributed variable with success 
probability p t . 

For convenience, the prefix code with length n of a Kite 
code is also called a Kite code and simply denoted by JC[n, k]. 
A Kite code JC[n, k] for n > k is a systematic linear code with 
r = n — k parity bits. We can write its parity-check matrix H 
as 

H=(H V ,H W ), (1) 

where H v is a matrix of size r x k that corresponds to the 
information bits, and H w is a square matrix of size r x r that 
corresponds to the parity bits. By construction, we can see 
that the sub-matrix H v is a random-like binary matrix whose 
entries are governed by the p-sequence. More specifically, 



Pr{ff M = ht ti } = 



Pt, 
1 ~Pt, 



(2) 



for t > and < i < k. In contrast, the square matrix H w is 
a dual-diagonal matrix (blanks represent zeros) 
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In the case of p t <C 1/2, a Kite code can be decoded as an 
LDPC code. 

B. Relationships between Kite Codes and Existing Codes 

A specific realization of a Kite code with finite length can be 
considered as a serially concatenated code with a systematic 
low-density generator-matrix (LDGM) code IU4I1 as the outer 
code and an accumulator as the inner code. Different from 
conventional serially concatenated codes, the inner code takes 
only the parity bits from the outer code as input bits. A specific 
Kite code is also similar to the generalized irregular repeat- 
accumulate (GeIRA) codes lTT3ll ifieh ifTsh . 

As an ensemble, Kite codes are new. To clarify this, we 
resort to the definition of a code ensemble 01911 . A binary linear 



code ensemble is a probability space (C, Q), which is specified 
by a sample space C and a probability assignment Q(C) to 
each C G C. Each sample C G C is a binary linear code, 
and the probability Q(C) is usually implicitly determined by 
a random construction method. The following examples show 
us three code ensembles with length n. 

• A code ensemble C g can be characterized by a generator 
matrix G of size k x n, where each element of G is 
drawn independently from a uniformly distributed binary 
random variable. 

• A code ensemble Ch can be characterized by a parity- 
check matrix H of size (n — k) x n, where each element 
of H is drawn independently from a uniformly distributed 
binary random variable. 

• A code ensemble C s can be characterized by a generator 
matrix G s = [Ik , P] of size kxn, where Ik is the identity 
matrix and each element of P is drawn independently 
from a uniformly distributed binary random variable. This 
ensemble can also be characterized by a random parity- 
check matrix H s = [P T , I n ~k\- 

In a strict sense, the above three ensembles are different. 
The ensemble C g has some samples with code rates less than 
k/n, and the ensemble Ch has some samples with code rates 
greater than k/n. The code ensemble C s is systematic and 
each sample has the exact code rate k/n. LT-codes (prefix 
codes with length n) have a slightly different sample space 
from that of C g but, with high probability, have low-density 
generator matrices. In contrast, the ensemble of Kite codes (of 
length n) with p t = 1/2 is the same as C s . An ensemble of 
Kite codes (of length n) with p t < 1/2 has the same sample 
space as that of C s but different probability assignment to each 
code. 

C. Design of Kite Codes 

Given a data length k. The task to optimize a Kite code is to 
select the whole p-sequence such that all the prefix codes are 
good enough. This is a multi-objective optimization problem 
and could be very complex. For simplicity, we consider only 
codes with rates greater than 0.05 and partition equally the 
code rates into 19 subintervals, that is, 



(0.05, 1]= |J (0.05^,0.05(^+1)]. 



(4) 



1<£<19 



We simply assume that the p-sequence is a step function 
of the code rates taking only 19 possibly different values 
{qe , 1 < I < 19}. That is, we enforce that p t = qe whenever 
t satisfies that the code rate k/(t + k) G (0M£, 0.05{£ + 1)]. 
Then our task is transformed into selecting the parameters 
{qg,l < £ < 19}. One approach to do this is the following 
greedy optimization algorithm. 

First, we choose qig such that the prefix code of length 
[fc/0.95j is as good as possible. Secondly, we choose qi$ with 
fixed gig such that the prefix code of length [fc / 0.90J is as 
good as possible. Thirdly, we choose qn with fixed (gig, gig) 
such that the prefix code of length [fc / 0.85J is as good as 
possible. This process continues until qi is selected. 



Let qi be the parameter to be optimized. Since the parame- 
ters qj with j > £ have been fixed and the parameters qj with 
j < £ are irrelevant to the current prefix code, the problem to 
design the current prefix code then becomes a one-dimensional 
optimization problem, which can be solved, for example, by 
the golden search method J20J. What we need to do is to make 
a choice between any two candidate parameters qg and q' t This 
can be done with the help of simulation, as illustrated in [ 12]. 



D. Existing Issues 

The above greedy optimization algorithm has been imple- 
mented in 01211 . resulting in good Kite codes. However, we 
have also noticed the following issues. 

1) In the high-rate region, Kite codes suffer from error-floors 
at BER around 10~ 4 . 

2) In the low-rate region, there exists a relatively large gap 
between the performances of the Kite codes and the 
Shannon limits. 

3) The optimized /7-sequence depends on the data length k 
and has no closed form. Therefore, we have to search 
the p-sequence for different data lengths when required. 
This inconvenience definitely limits the applications of 
the Kite codes. 

The first issue has been partially solved by either taking RS 
codes as outer codes in 01211 or inserting fixed patterns into 
the parity-check matrix in 02111 . The objective of this paper is 
to provide simple ways to overcome the above issues. 

III. Improved Design of Kite Codes 

As mentioned in the preceding section, we consider only 
code rates greater than 0.05 and partition equally the interval 
(0.05, 1] into 19 sub-intervals. Given a data length k. Define 

n * = LdbJ for 1 < ^ < 20. 

A. Constructions of the Parity-Check Matrix 

We need to construct a parity-check matrix H = (H v , H w ) 
of size (m — k) x m, where H v is a matrix of size (n% — k) x 
k corresponding to the information bits and H w is a lower 
triangular matrix with non-zero diagonal entries. Let C\n\, k] 
be the code defined by H. We then have a sequence of prefix 
codes with incremental parity bits. Equivalently, we can have 
a sequence of prefix codes with rates varying "continuously" 
from 0.05 to 1. 

To describe the algorithm more clearly, we introduce some 
definitions. Let ffW = (Hy,Hw ) be the parity-check 
matrix for the prefix code with length m. Let Pti e \-, j) 
and H^\t,-) be the j-th column and t-th row of Hi e \ 
respectively. Sometimes, we use H}) (t,j) to represent the 
entry of at the location (t,j). Given {q e , 1 < i < 19}, the 
parity-check matrix H = can be constructed as follows. 

First, the matrix H v corresponding to the information bits 
can be constructed progressively as shown in the following 
algorithm. 

Algorithm 1: (Row- weight Concentration) 

1) Initially, set H^ ^ be the empty matrix and £ = 19. 



2) While l> 1, do the following. 

a) Initialization: generate a random binary matrix 

of size (ri£ — ng + i) x k, where each entry is indepen- 
dently drawn from a Bernoulli distribution with success 
probability qi\ form the matrix corresponding to the 
information bits as 

f-( %•> )■ 

b) Row-weight concentration: 

i) Find t\ (rit+i — k < t\ < ni — k) such that 
the row Hi ,{tx,-) has the maximum Hamming 
weight, denoted by W max ; 

ii) Find to (ng+i — k < to < nt — k) such that the 
row ^ £) (^,-)hasthe minimum Hamming weight, 
denoted by W m i n ; 

iii) If W max = Wmin or W max = W min + 1, set I <r- 
i — 1 and go to Step 2); otherwise, go to the next 
step; 

iv) Find ji (0 < ji < k-1) such that H^\tx,jx) = 1 
and that the column Hi e \-,jx) has the maximum 
Hamming weight; 

v) Find jo (0 < jo < k-1) such that H ( v l \t , jo) = 
and that the column H$p(-,jo) has the minimum 
Hamming weight; 

vi) Swap Hi l \t ,jo) and i?i (*i, ji); go to Step i). 
Remarks. From the above algorithm, we can see that the 

incremental sub-matrix corresponding to the information 
vector is finally modified into a sub-matrix with rows of weight 
Wmi n or Wmin + 1. Such a modification is motivated by a 
theorem as shown in I22I1 stating that the optimal degree for 
check nodes can be selected as a concentrated distribution. 
Such a modification also excludes columns with extremely 
low weight. 

Second, the matrix H w corresponding to the parity bits is 
constructed as a random accumulator specified in the following 
algorithm. 

Algorithm 2: (Accumulator Randomization) 

1) Initially, H w is set to be the identity matrix of size (m — 
k) x (m — k). 

2) For t = 0, 1, • • ■ , nx — k — 2, do the following step by 
step. 

a) Find the maximum integer T such that the code rates 
k/(k + T) and k/(k + t + 1) falls into the same 
subinterval, say (0.05^,0.05(^+1)]; 

b) Choose uniformly at random an integer ix S [t + 1, T}; 

c) Set the ii-th component of the t-th column of H w to 
be 1, that is, set H w (ix,t) = 1. 

Remarks. The accumulator randomization approach intro- 
duces more randomness to the code such that the current parity 
bit depends randomly on previous parity bits. We also note 
that, for all £, H^' has the property that each of all columns 
but the last one has column weight 2. It is worth pointing 
out that both the row-weight concentration algorithm and the 



TABLE I 
THE p-SEQUENCES 



Code rate k/(k + t) 


pt{k = 1890) 


pt(k = 3780) 


(0.95, 1.00] 


0.0380 


0.0170 


(0.90, 0.95] 


0.0200 


0.01 10 


(0.85, 0.90] 


0.0130 


0.0050 


(0.80, 0.85] 


0.0072 


0.0039 


(0.75, 0.80] 


0.0046 


0.0023 


(0.70, 0.75] 


0.0038 


0.0020 


(0.65, 0.70] 


0.0030 


0.0016 


(0.60, 0.65] 


0.0028 


0.0013 


(0.55, 0.60] 


0.0018 


0.0010 


(0.50, 0.55] 


0.0017 


0.0009 


(0.45, 0.50] 


0.0015 


0.0007 


(0.40, 0.45] 


0.0014 


0.0007 


(0.35, 0.40] 


0.0013 


0.0006 


(0.30, 0.35] 


0.0012 


0.0006 


(0.25, 0.30] 


0.0012 


0.0005 


(0.20, 0.25] 


0.0012 


0.0005 


(0.15, 0.20] 


0.0011 


0.0005 


(0.10, 0.15] 


0.0011 


0.0004 


(0.05, 0.10] 


0.0011 


0.0004 




SNR (Signal-lo-Noise-Raflo) = 10log 1Q (1/o 2 ) [dB] 

Fig. 1. Performances of the constructed improved Kite code and Kite code 
with k = 1890. The maximum iteration number is 50. From left to right, 
the curves correspond to rates 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, and 0.9, 
respectively. 



accumulator randomization algorithm are executed in an off- 
line manner. To construct the prefix code of length ni, both of 
these two algorithms modify only the incremental ni — ri£ + i 
rows of the parity-check matrix associated with the original 
Kite code, which do not affect other rows. 

B. Greedy Optimization Algorithms 

It has been shown that, given {qg, 1 < £ < 19}, we can con- 
struct a parity-check matrix H by conducting the row-weight 
concentration Algorithm Q] and the accumulator randomization 
Algorithm [2] Hence, we can use the greedy optimization 
algorithm to construct a good parity-check matrix. We have 
designed two improved Kite codes with data length k = 1890 
and k = 3780. The p-sequences are shown in Table |T] Their 
performances are shown in Fig. [T]and Fig. [2] respectively. 

From Fig. Q] and Fig. [2] we have the following observations. 

• The improved Kite codes perform well within a wide 
range of SNRs. 




-5 5 

SNR (Signal-to-Noise-Ratio) = 10log (1/o 2 ) [dB] 



Fig. 2. Performance of the constructed improved Kite code with k = 3780. 
The maximum iteration number is 50. From left to right, the curves correspond 
to rates 0.05, 0.10, 0.15, ■ • • , 0.90, and 0.95, respectively. 
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Fig. 3. Comparisons of the empirical formula for qg with the parameters 
obtained by the simulation-based greedy optimizing algorithm. 



• In the moderate-to-high-rate region, the improved Kite 
codes perform almost as well as the original Kite codes 
in the water-fall region, while the improved Kite codes 
can effectively lower down the error-floors. 

• In the low-rate region, the improved Kite codes are better 
than the original Kite codes. For example, when the 
code rate is 0.1, the improved Kite code can achieve a 
performance gain about 1.5 dB over the original Kite code 
at BER = 1CT 5 . 

• As the data length k increases, the performance of the 
improved Kite codes can be improved. For instance, when 
the code rate is 0.2, the improved Kite code with data 
length 3780 achieves a performance gain about 0.2 dB 
over the improved Kite code with data length 1890 around 
BER = 10~ 5 . 

C. Empirical Formula 

As shown by the above two examples, the optimized p- 
sequence depends on data length k, which makes the design of 
improved Kite codes time-consuming. To accelerate the design 
of improved Kite codes, we present an empirical formula for 



the /^-sequence. To this end, we have plotted the optimized 
parameters in Table J] in Fig. [3] We then have the following 
empirical formula 



It 



1.65 



2.0 



(6) 



(1.5 - 0.05£) 6 

for 1 < i < 19. From Fig. [3] we can see that the above 
formula is well matched to the optimized p-sequence. 

IV. Applications of Kite codes to IR-HARQ 
Systems 

In summary, for an arbitrarily given data length k, we 
can generate (off-line) a systematic mother code IC[n\,k] 
with ni = L^/0-05j by constructing a parity-check matrix 
according to the following procedure. 

1) Calculate the g-sequence according to ©; 

2) Generate a semi-random parity-check matrix H = 
(H V ,H W ) according to (fJJ and ©; 

3) Modify H by performing Algorithm Q] and Algorithm |2] 
We consider to combine the Kite code with two-dimensional 

modulations. The coded bits are assumed to be modulated into 
a constellation of size 2 b by the (near-)Gray mapping and then 
transmitted over AWGN channels. The receiver first extracts 
the a posteriori probabilities of coded bits from each noisy 
coded symbol and then performs the iterative sum-product 
decoding algorithm. If the receiver can decode successfully, it 
feedbacks an "ACK"; otherwise, the transmitter produces more 
parity bits and transmits more coded signals. The receiver tries 
the decoder again with the incremental noisy coded signals. 
This process continues until the decoding is successful. To 
guarantee with high probability that the successfully decoded 
codeword is the transmitted one, we require that the receiver 
starts the decoding only after it receives nig noisy coded 
bits. With this constraint, we have never found erroneously 
decoded codewords in our simulations. That is, the decoder 
either delivers the transmitted codeword or reports a decoding 
failure. If n is the length of the code at which the decoding is 
successful, we call r\ — kb/n bits/symbol the decoding spectral 
efficiency, which is a random variable and may vary from 
frame to frame. 

The average decoding spectral efficiency of the Kite code 
of dimension k = 9450 with different high-order modulations 
for AWGN channels are shown in Fig. |4] Also shown are 
the capacity curves. Taking into account the performance 
loss of shaping gain 1.53 dB caused by conventional QAM 
constellations for large SNRs [23], we conclude that the 
improved Kite codes perform well essentially in the whole 
SNR range. 

V. Conclusion and Future Works 

In this paper, we have presented three approaches to im- 
prove the design of Kite codes. In the moderate-to-high-rate 
region, the improved Kite codes perform almost as well as 
the original Kite codes, but exhibit lower error-floors. In the 
low-rate region, the improved Kite codes are better than the 
original ones. In addition, we have presented an empirical 




Fig. 4. The average decoding spectral efficiency (at "zero" error probability) 
of the improved Kite code with data length k = 9450 over AWGN channels. 
The maximum iteration number is 50. 



formula for the p-sequence, which can be utilized to construct 
rate-compatible codes of any code rates with arbitrarily given 
data length. Simulation results show that, when combined with 
high-order modulations, the improved Kite codes perform well 
essentially in the whole range of SNRs. 

Future works include to compare Kite codes with other 
codes such as Raptor codes in terms of both the performance 
and the complexity when applied to adaptive coded modula- 
tions over noisy channels. 
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